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Abstract. We study optimal trade execution strategies in financial markets with discrete order flow. 
The agent has a finite liquidation horizon and must minimize price impact given a random number of 
incoming trade counterparties. Assuming that the order flow A'^ is given by a Poisson process, we give a 
full analysis of the properties and computation of the optimal dynamic execution strategy. Extensions, 
whereby (a) A*" is a fully-observed regime-switching Poisson process; and (b) TV is a Markov-modulated 
compound Poisson process driven by a hidden Markov chain, are also considered. We derive and compare 
the properties of the three cases and illustrate our results with computational examples. 
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1. Introduction 

One of the most important problems faced by a stock trader is how to unwind large block orders of 
secm'ity shares. Liquidation of a large position in a security is a challenge due to two factors: (a) possible 
lack of a counterparty; and (b) price impact that depresses prices by increasing supply. This occurs 
because the immediate market resiliency is limited and a single large order may exhaust all current 
buyers, bringing about dramatic price declines. Price impact implies that it is generally beneficial to 
split the order into several smaller blocks and sell each sub- block separately. Presence of counterparties 
is less of a concern in traditional limit order book markets where a market maker is always quoting 
a price. However, trading in such markets may be disadvantageous due to information leak/privacy 
concerns. Indeed, by examining the order book, other participants may recognize the large trader and 
move against her, even if she attempts to split her trades. Thus, a recent trend involves trading in dark 
pool markets where there is no order book and buyers/sellers are matched up electronically without 
revealing any information. Such dark trades minimize information leakage and dramatically reduce risk 
of adverse price movement compared to conventional limit book trading. However, liquidity becomes a 
major concern as there is no market-maker and no counterparty may be forthcoming. We refer to trade 
publications such as QPL Newsletter [2008] for more information on the evolving marketplace of dark 
pools and their numerous specification variations. 

In this paper, we propose a new framework that explicitly takes into account such liquidity features 
of large order trades. Thus, we replace the classical continuous trading environment with a discrete 
order book. In our model, incoming buy orders are represented by a Poisson process which encodes the 
order arrival times. To capture the empirical feature of splitting large orders into smaller pieces, we 
will focus on price impact and eschew consideration of actual prices. Larger trades involve a volume 
discount and therefore tend to carry higher spread versus the current quoted limit order price. Also, 
smaller trades are desirable in order to maintain anonymity and mitigate information leaks. Subject to 
the constraint that trades are only possible at order times, the objective of the agent is to execute her 
large order trade within a specified time- window while minimizing this price impact. 

Most of the existing analysis of optimal execution has focused on limit order book markets, see e.g. 
Alfonsi et al. [2007], Almgren [2003], Almgren and Lorenz [2006], Obizhacva and Wang [2006], Schied 
and Schoneborn [2008a, b] . Since a market maker is always present, all cited models assume a continuous- 
time trading environment, with the asset price usually represented by a diffusion price process. The price 
impact is decomposed into a temporary and permanent effects and execution strategies are specified in 
terms of liquidation rates per unit time. The overall problem is then translated into a continuous or 
singular stochastic control formulation. Our approach is quite different, as in our case all trades are 
discrete and therefore an execution strategy corresponds to an impulse control setting. Also, in the 
above literature the optimal liquidation strategies turn out to be deterministic and can be sometimes 
explicitly determined. In contrast, our optimal strategies are intrinsically path-dependent and will be 
affected by the stochastic order flow. Finally, while the above papers typically consider an infinite 
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horizon, we assume that the agent has a hard deadhne to hquidate her large trade. Thus, time-to- 
maturity is a crucial variable in our setup and can be also used to express time-dependencies of real 
markets, where e.g. the opening and closing hours are typically much more liquid than midday. To 
sum up, our contribution is a new approach to modeling order execution liquidity in terms of point 
processes. As we show below, our models are flexible, allow for a quick implementation and admit 
fruitful probabilistic analysis. 

Let us now outline the basic ingredients of our model. We assume that the order book is a Poisson 
process with arrival times cjj which denote the timestamp of the i-th order. In our base model we 
postulate that A is a simple Poisson process with constant intensity A on a stochastic basis {0,,J-',¥). 
Suppose the agent has k shares (or units) to sell and an execution horizon of T time epochs. We 
postulate that at terminal date T all unsold units are immediately disposed off as one large trade, e.g. 
through the traditional limit order book. Thus, effectively there is always one more matching order 
arriving at T. The price impact is represented in terms of a strictly increasing and strictly convex market 
depth function F, where F{a) represents the cost of placing a trade of size a {F{a) could also represent 
the average cost of a random price impact, assuming this randomness is independent of everything else 
in the model). 

Let F = {J^t)i = c(As : < s < t) be the filtration generated by the observation process. Then 
the optimization problem of the agent can be written as 



(1.1) vik.T) = inf E 



inf v,(k,T), ken+,T e 



where Ak is the set of all F -adapted, integer- valued, positive and non-increasing processes whose values 
change only at the time of jumps of the Poisson process N with = k. The convexity of F is interpreted 
as the limited market resiliency and encourages the agent to split the large /c-order into smaller pieces. 
However, placing a smaller trade now is risky as no more orders might come in and the trader will be 
left with a large leftover at T (which will carry a large associated penalty). Thus, the convexity of F 
also represents the impatience of the agent in terms of current versus future trading and is formally 
similar to the risk-aversion level in Schied and Schoneborn [2008a, b]. 

In terms of the stochastic control formulation, (1.1) is related to best choice problems with Poisson 
processes, see e.g. Cowan and Zabczyk [1978], Bruss [1987]. In particular, Stadje [1987, 1990] studied a 
similar problem for a Poisson process in the context of multi-item dynamic pricing. 

The mathematical problem in (1.1) is a compromise between a tractable analytical model and real 
markets. In general, the execution problem with illiquid trading is not so well-studied and a big challenge 
is to develop parsimonious models that will prescribe reasonable optimal liquidation policies. The use 
of a Poisson process for N allows for a comprehensive analysis of (1.1) in Section 2, however, it is clearly 
not very rich to capture all the intricacies of real order books. Accordingly, we consider in Section 3 
several extensions to address such issues. Our base model allowed arbitrary trade sizes; in practice the 
agent is only able to trade up to the order size which is the second dimension of the order flow. To 
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reproduce this feature, in Section 3.1 we take to be a compound Poisson process, consisting of pairs 
(ai,Yi) of (order times, order sizes). Correspondingly, the original problem (1.1) is modified to constrain 
^o-i- — £,ai ^Yi- Because of this constraint, the agent is expected to preemptively place larger orders in 
case a large matching order is forthcoming. 

Second, the base model assumed that the intensity of was constant throughout the problem horizon. 
Given widespread evidence that real markets experience different liquidity regimes, in Section 3.2 we 
extend our model to the case where is a Markov-modulated Poisson process. Thus, we will assume 
A = X{Mt) where M is an (observed) independent Markov chain that describes the liquidity state of 
the market. Similarly, the distribution of sizes Yi will also be modulated by M. In Section 3.3 we 
then consider the even more realistic (and more complex) situation where traders do not observe M. 
Indeed, market participants do not know the current market liquidity and dynamically infer it given 
matched dark pool order flow. To capture this phenomenon, in Section 3.3 we will assume that the 
liquidity regime is modeled by a hidden Markov chain M that modulates the intensity and the jump 
distribution of crossing orders. To illustrate the different models mentioned above. Section 4 presents 
several computational examples; finally Section 5 concludes and points possible future extensions. 



2. Analysis of the Optimal Liquidation Problem 



In this section we analyze the properties of v{k,T) as defined in (1.1). The treatment below allows 
us to give a clear insight of the structure of v{k,T) and leads to a particularly simple algorithm to 
compute V and the associated optimal strategy, see Remark 2.1. Our first observation is that v{k,T) 
satisfies the following dynamic programming equation: 



min {v{k - a,T - ai) + F{a))} ■ 1{^,<t} + F{k) ■ 1{<ti>t} 

a£|l,...,fc| 



(2.1) v{k,T)=¥. 
A more general version of this dynamic programming principle is proved in Proposition 3.2. 



2.1. Computing v{k,T). To illustrate the problem, let us explicitly compute v{k,T) for a few values 
of k. First, for k = 1 we trivially have v{l,T) = -F(l), as one can simply wait till T to make the single 
unit trade. Since F is strictly convex, when there are two units to sell, it is clearly optimal to try to 
place two orders of size one. This will be possible as long as there is at least one arrival before date T, 
i.e. N{T) > 1 (recall that the remainder can always be disposed of at T). Applying (2.1) and recalling 
the properties of the Poisson process yields 

v{2,T) = 2F(1) • (1 - e~^^) + F(2) • e~^^. 

When k = 3, the agent needs to sell three units. Once an incoming order arrives, the agent should 
trade one unit, as getting rid of two or three units is not optimal (in the worse case, she will sell one 
unit now and the remaining two at T). Conditioning on the time ai of the first order, and using (2.1) 
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her expected minimal cost is then 

^(3, T) = F(3)P(ai > T) + E[(F(1) +v{2,T- (Ti))1|,,<t|] 

= F(3)e-^^+ / {F{l)+v{2,T - s))\e'^' ds 
Jo 

= e-^^F(3) + Are-^^F(2) + (3 - 36"^^ - 2Are-^'^)F(l). 

The case /e = 4 is the first non-trivial case. Indeed, the agent can sell either one or two units when there 
is an incoming order (other strategies are clearly not optimal). This decision will be based on whether 
v{3,T — ai) +F{1) > v{2,T — ai) +F{2) at the first arrival time ai. If the latter inequality is true, then 
one is better off selling two units, otherwise a single unit is optimal to trade. Observe that both sides of 
the last inequality can be explicitly computed using previous formulas. From this computation it can 
be observed that as time to maturity, T, becomes smaller the agent gets more impatient and trades two 
units instead of one as soon as there is an arrival. Thus, there exists a critical threshold t^^'^) such that 
if r — (Ti > t^^'^) then it is optimal to trade just one unit, and if T — ci < t^^'^^ then it is optimal to 
trade two units. 

Let a{k, T) be the optimal order size to place when an order arrives given that one has k units 
remaining and T epochs until the terminal date. Then the above analysis shows that o(l, T) = a(2, T) = 
a(3,r) = 1 for all T > 0, while a(4,T) = 1 + l|2i^^(4,2)|. In general, it follows from (2.1) that 

(2.2) a{k, T) = argmin,g|i_,..^fc|{?;(A: - a, T) + F(a)}. 

The above equation is simply the dynamic programming principle that says that the best immediate 
action is to sell a units, such that the sum of the current cost F{a) and expected future costs as 
represented by the value function v{k — a, T) is minimized. To avoid ambiguity, we will assume that if 
the minimizer in (2.2) is not unique, then a(A;, T) is the smallest minimizer. 

We conclude this section with upper and lower bounds for v{k, T). The next lemma gives an easy to 
compute lower bound for the value function. Below, we extend the domain of F to the whole positive 
real line such that F : M_|- M4. is still strictly convex and increasing. 

Lemma 2.1. We have 

(2.3) v{k,T)> ^ F(—^jF{N{T)=n) + kF{l)F{N{T)>k-l):=v{k,T). 

n<k-l V"'+ / 

Proof. Consider a genie who is affected by the randomness but for each state of the world can tell how 
many arrivals there will be. Let us assume also that the genie is allowed to divide up her orders into 
non-integral bits of size > 1. Then, conditional on knowing N(T) = n < k — 1, the genie should execute 
n + 1 trades of size k/(n + 1) (the last trade comes at the period close). Consequently, the right hand 
side of (2.3) is the genie's solution to (1.1) which is clearly better than the optimal solution of the 
mortal, who does not possess any clairvoyance about and can only divide up her blocks into integral 
units. □ 
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As counterpart to Lemma 2.1, we have the following tight upper bound to the value function. 
Lemma 2.2. 

k 



(2.4) v{k,T) < min z7'=(A;, T) = min< 



k 

CA 



F{c) +F{k-c 



k 
c J 



• IP ( N{T) > 



c J 



+ J2 (n-F{c)+F(k-n - ^\¥{N{T) = n)^ 



in which [x/c\ is the largest integer smaller than x/c. 

Proof. The right hand side of (2.4) is the cost of a constant c-strategy. This is the strategy where the 
agent insists on trading c units at each arrival time until terminal date T, whence the remainder is 
liquidated. Although she originally optimizes over c, clearly this is a sub-optimal strategy. The bound 
in (2.4) becomes tight as T — > oo, the liquidity risk vanishes and the optimal strategy is to always trade 
a single unit c* = 1 . □ 

2.2. Properties of the value function. We now present a series of Lemmas that describe the prop- 
erties of V and a. This section then culminates with Proposition 2.1 which summarizes our analysis 
below. 

In parallel with the original formulation in (1.1) in terms of dynamic controls ^, one may also describe 
Markov control strategies as {b{k,T) : k £ N,T £ specifying the trading amount conditional on 

still having k units left with time horizon of T periods. Given such {b{k, T)}, the corresponding dynamic 



unit inventory process is denoted by ^, 



{b,k,T) 



(2.5) 



(b,k,T) 



and satisfies 

t] dNt, 



Ab,k,T) 



k. 



(b k T) 

Economically, Q ' ' represents the remaining number of units at date t when employing the execution 
strategy b. Using a to denote the strategy characterized by (2.2), it follows that an optimal inventory 
process for (1.1) is given by ^* = ^('^'^'^). In particular, an optimal control is of the Markovian feedback 
type. 

The following lemma immediately follows from the definition of the value function in (1.1). 

Lemma 2.3. The function k v{k,T) is increasing and the function T v{k,T) is decreasing. 

Proof. The above results are model-free in the sense that they depend solely on the convexity of F and 
not on any properties of the arrival process N. Thus, it is instructive to give a short proof. Let ^ be any 
admissible control for v{k,T). Then ^ is also admissible for v{i,T) for any i > k, which immediately 
establishes the first part of the lemma. Moreover, for any T' > T, define a control ^' via = for 
t <T and — = 1^^. >o for T < ai < T' . Then ^' is an admissible control for v{k,T'). Moreover, 
due to the convexity of F, the pathwise cost of ^' is less than or equal to the pathwise cost of ^, 



i:ai<T 



j:T«Tj<T 



i:ui<T 
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with strict inequality if > 1 and N(T') - N{T) > 0. It follows that v{k, T'; < v{k, T, ^) with strict 
inequahty as long as F{N(T) = 0, iV(T') - N{T) > 0) > 0. Note that the last statement is satisfied for 
A'^ a Poisson process and any T < T' . □ 

The following basic lemma shows that the slope of v is smaller than that of F. 

Lemma 2.4. For any ki > k2 and t we have v{ki,T) — v{k2,T) < F{ki) — F{k2)- Alternatively, 
F{k) — v{k, T) is increasing in k. 

Proof. Let ^^"^ denote ^('^'*^2,T)_ j^ggall that v^{k,T) denotes the expected performance of any control 
^. Interpreting as a sub-optimal control for v{ki,T) (which disposes of the extra ki — k2 units at 
maturity), we have 

v{ki,T) - v{k2,T) < v^k2 {ki,T) - v^k2 (/c2, T) 

k2 



E 

k2 



Y^(^F{i + k,-k2)-F{i))l^^,^,^^^ 

.i=0 ^ 



< J2^inki) - F{k2))l .2^ ] = F{k^) - F{k2), 



i=0 

where the second inequality follows from the convexity of F, whereby F{a + y) — F(y) is increasing in 
y. □ 

The following lemma shows that if one starts with more units initially and sells them in an optimal 
way, then one will always have more units at any later point in time (an intuitive observation). 

Lemma 2.5. Let denote k G N+. Then for l>k we have that > for all t G [0,r]. 

Proof. First note that if at any date s < t we would have = then it follows from (2.5) and the 
Markov nature of a{k, T) that for all s' > s we will have = as well. Thus, to have < on a set A 
of strictly positive probability there necessarily must be an arrival aj such that di := ^ij- > Cctj- '■— 
and be := < := bf^ on A. By construction, b^ = d^ — a{d£,T — aj) and b^ = d^ — a{dk,T — aj). 
Moreover, 

a((i£, T — (Tj) =: ag = eLicgm.m^{v{d£ — a,T — aj) + F{a)}; 

a{dk, T - Gj) =: au = argmin„{t;(dfc - a,T - aj) + F{a)}. 

Define ce = dg — d^ + ak > CLk, and Ck = dk — di + ae < ag. Therefore from (2.6) (and recalling that 
is the smallest minimizer, while > q) 

( v{di - ae,T - aj) + F{ai) < v{di - ci,T - aj) + F{ci>), 

\ v{dk -ak,T - aj) + F{ak) < v{dk - Ck,T - aj) + F{ck). 

Re-arranging, we obtain 

v{di - ce,T - aj) - v{de - ae,T - aj) > F{ae) - F{ce), 
v{dk -ak,T - aj) - v{dk - Ck,T - aj) < F{ck) - F{ak). 



(2.6) 



(2.7) 



8 ERHAN BAYRAKTAR AND MICHAEL LUDKOVSKI 

However, the left-hand-sides of both equations in (2.7) are the same by construction and are in fact 
equal to v{bk, T — aj) — v{bi, T — aj) > 0. On the other hand, since > and q > a^, while 

ae - ce = Ck - ak = ae - Uk + dk - de = bk - be > 0, 

by the convexity of F we must have -F(a^) — F(c£) > F(ck) — F{ak), contradicting (2.7). □ 

The above lemma implies the following useful corollary regarding optimal actions for different inven- 
tory levels. 

Corollary 2.6. For any T G R+ and £ < k, we have a{k, T)—a{l, T) < k — i for all t >0. In particular, 
a{k + l,T) < a{k,T) + 1. 

Corollary 2.6 follows from Lemma 2.5 since the given relation between optimal actions is necessary 
to keep the corresponding inventory processes ordered correctly. 

Lemma 2.7. We have v{k, T) is "convex" in k, that is for any k G N+ 

(2.8) v{k, T) - v{£, T) > v{k - n,T) - v{i - n,T), V£ e {1, • • • , A;}, Vn E {1, • • • , 
Also, for any T G M_|_ and i < k, 

(2.9) a{£,T) <a{k,T). 

Proof. We will prove both of the above statements together by induction. Note that (2.8) holds when 
k = l since v{0,T) = 0. Also a{l,T) > a{0,T) = 0. Suppose that (2.8) and (2.9) hold for some A; > 1. 
We will show that they are also true when k is replaced by A; -|- 1. It is enough to prove that 

(2.10) v{k + 1,T)- v{k, T) > v{k, T) - v{k - 1, T), 

and that a{k + \,T) > a{k, T). 

First, by definition a{k + 1,T) = argmin„{f (A; + l — a,T) + F{a)}. Now suppose that a{k, T) > b > 1. 
This implies that 

v{k -b,T) + F{b) > v{k - a{k, T),T) + F{a{k, T)) 
^ v{k -b,T)- v{k - a{k, T),T) > F{a{k, T)) - F{b) 
=^ v{k + l-b,T) -v{k + l- a{k, T),T)> F{a{k, T)) - F{b) > 

=^ a{k + l,T)^b, 

since the sale of b shares is less preferable than selling a{k, T) shares. The third line follows from the 
induction hypothesis since k + 1 — b < k. Since a(k + 1,T) ^ b for any b < a{k, T) we necessarily have 
that a{k + l,T)> a{k,T). 

Thanks to the fact that a{k -\- 1,T) > a{k,T) for all T S M+, the induction hypothesis on a, and 
the dynamics of ^* = ^("'*'^) given in (2.5) we have that £,^^1 — C^^^ = ~ + ^a„, where 

A(j„ G {0, 1} (Ao-„ < 1 due to Corollary 2.6). The process A should be thought of as the "additional" 
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action needed to sell one more unit starting with k units. Now, the left-hand-side of (2.10) becomes 
v{k + l,T) -v{k,T) = 



E 



(2.11) 



E 



E l{A.„>o}{i^(e:i - & - FiCt^ - + 1{E„ A.„=o}{i^(er^) - Fier)} 



n:a„<T 



E i{A.„>o}{i^(e„- - e.j + 1) - F{it. - it))] + i{E„ A.„=o}{i^(e^ + 1) - Fm 

n:an<T 



Let us analyze the right-hand-side of (2.10) . Define the control ^' by = A: and ^^^n- ~ ^o-„ 
— £^~^ + A„ . This is an admissible control for selling k units. Then, 



v{k,T) -v{k-l,T) <E 



E 

n:(T„<r 



FiC^.-Cj + Fi^T) 



E 



E i^(e:i-e:^)+F(e^) 



n:a-„<r 



E 



E i{A.„>o}{i^(el-i - et' + 1) - ^(eti - et')} + i{E„ A.„=o}{i^(e^-^ + 1) - i^(e^-^)} 



n:o-„<r 



< IE E l{A.„>o}{i^(el™ - + 1) - F{e^_ - ej} + 1{E„ A.„=o}{i^(^^ + 1) - Fii'r)} 

n:a„<T 

= v{k + l,T)-v{k,T). 

The last inequality is by the convexity of F and the induction hypothesis on a from which it follows 
that ^^^i - < ~ The last equahty is from (2.11). This completes the proof. □ 

To better connect Lemma 2.7 with the notion of convexity, we state the following corollary: 
Corollary 2.8. Fix a. A; E N with a < k. Then for any b £ N+ with a < b < k we have that 

(2.12) v{k-a- 1,T) < av{k -b,T) + {l- a)v{k -a,T), 
in which a = l/{b — a). 

Proof. We will prove this statement by induction. Note that (2.12) or equivalently, 

(2.13) v{k-a,T) -v{k-b,T) < {b - a)[v{k - a,T) - v{k - a - 1,T)] 

holds for 6 = a-|- 1. Let us assume that (2.13) holds for 6 = a-|-n (in which n is such that a + n + 1 < k), 
i.e., 

v{k — a,T) — v{k — a — n,T) < n[v{k — a,T) — v{k — a — 1, T)]. 

On the other hand, 

v{k — a — n,T) — v{k — a — n — 1,T) < v{k — a,T) — v{k — a — 1, T), 
thanks to Lemma 2.7. Adding the last two inequalities, we obtain (2.13) for b = a + n + 1. □ 
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The above corollary in particular implies that there are at most two minimizers in (2.2). Indeed, if 
oi = a{k,T) < a2 = a2{k,T) are both minimizers in (2.2), i.e. 

(2.14) v{k - ai, T) + F(ai) = v{k - oa, T) + ^(03), 

then with a = — - — < 1, we obtain 

v{k - ai - 1) + F{ai + 1) < av{k - 02, T) + (1 - a)v{k - ai,T) + 0^(03) + (1 - a)F(ai) 

< v{k-ai,T)+F{ai), 

where the last line used (2.14). This is a contradiction as a{k,T) = ai is the smallest minimizer of 
(2.2). 

Lemma 2.9. Define 

(2.15) G{k,T) = v{k,T) - min [v{k - a,T) + F{a)]. 

aG{0,l,...,A:} 

The map k G{k, T) is non- decreasing for all T G M4.. 

Proof. We will show that G{k,T) > G{i,T) for k > i. Since a{e,T) < a{k,T), 

G{k, T) > v{k, T) - {v{k - a{e, T),T) + F{a{i, T))) 

> v{l, T) - {v{i - aii, T),T)+ F{a{i, T))) = G{i, T), 

in which the second inequality follows from Lemma 2.7. □ 

The quantity G{k, T) in Lemma 2.9 represents the maximal gain from an immediate impending trade. 
Lemma 2.9 has the interpretation that the more units the agent still has, the more eager she is to sell 
them and so the benefit of a matching order is larger. The next lemma shows that G is also related to 
the time-derivative of v. 

Lemma 2.10. The derivative of v with respect to time-to-maturity is 

(2.16) dTv{k,T) = -XG{k,T). 

Proof For h > 0, let A = {ai > h}, B = {ai < h,a2 > h} and C = {A U Bf. We have that 
P(A) = e-^^, ¥{B) = XHq-^^ and P(C) = o{h). Using the dynamic programming principle, we can 
write 

v{k,T + h) = ¥.[v{k,T)lA + {v{k,T) - G{k,T))lB + Xlc] 

in which X is a bounded random variable. Then sending /i ^ we obtain 

v{k,T + h)-v{k,T) ^. ¥.[v{k,T){lA^B) - G{k,T)lB] - v{k,T) + o{h) 

lim = lim ; 

h^Q h h^o h 

= Um=^5^(MT±M = -AG(.,T). 

h^O h 

□ 
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Using Lemma 2.10 we may complete our description of the properties of a(k,T). First, the next 
lemma shows that optimal trading amount decreases as the horizon becomes longer. 

Lemma 2.11. For any S >T, a{k,S) < a{k,T), VA; G N+. 

Proof. For any b > a{k,T) 

v{k -b,T)+ F{b) > v{k - a{k, T),T) + F{a{k, T)) 

^ v{k - a{k, T),T)- v{k -b,T)< F{b) - F{a{k, T)). 

We have that dTv{k,T) < dTv{i,T) for i < k, due to Lemmas 2.9 and 2.10. Therefore, 

v{k - a{k, T),S)- v{k -b,S)< v{k - a{k, T),T) - v{k -b,T) < F{b) - F{a{k, T)) 

which implies that a{k, T) performs strictly better than action b for the minimization problem mina£|o^...;}{ 
a, S) + F{a)} which implies that b ^ a{k, S), which is the smallest minimizer for this problem. Since 
this holds for any b > a{k,T) we necessarily have that a{k,T) > a(k,S). □ 

In the next lemma we shall see that T a{k, T) decreases to 1. 

Lemma 2.12. limT^oo v{k,T) = kF{l) and limT^oo O'ikjT) = 1. We also have that a{k,0) = \_k/2\. 

Proof. Recall from Lemma 2.2 that v{k,T) < v^{k,T) where denotes the performance of a constant 
1-strategy that always sells a single unit. Since 

fe-i 

v\k, T) = kF{l)¥{N{T) > k) + ^{nF{k) + F{k - n))¥{N{T) = n) ^ kF{l) as T ^ oo, 

n=0 

while v{k, T) > kF{l) VT, the first statement of the lemma follows. 

Let us choose a positive < 6 < F{2) — 2F{1). Fix A; > 0; by above, for large enough T, we have 
that v{a, T) < aF{l) + 5 for all a G {1, • • • , k}. Then by convexity of F 

v{k - 1, T) + F(l) < (A; - l)F(l) + 5 + F{1) < {k - c)F{l) + F(c) < v{k - c, T) + F(c), 

for any T >T and any 1 < c < fc. Comparing with the definition of a{k,T) in (2.2), we conclude that 
a(/fc,r) = 1 for T > T. □ 

Corollary 2.13. There exist distinct thresholds such that a{k,T) = i when 
(2.17) ^(fc,m) < 2" < 

Proof. The basic idea of the corollary follows from Lemma 2.11. It remains to show that the thresholds 
are distinct, i.e. t^^''^^ < t^^'^~^\ so that as a function of T, a{k,T) experiences jumps of size 1 only. 

Toward a contradiction, suppose that there exists T and level k such that a{k,T—) — a{k,T) > 1. 
Let a = a{k, T) and b = a{k, T—) > o + 1. Since 1 < a{k, •) < [A;/2j is non-increasing and has at most 
\k/2\ — 1 jumps, there exists (5 > such that b = a{k, T — s) for all s < 5. By optimality of b we have 
that 

v{k -b,T-s)+ F{b) = v{k - a{k, T-s),T-s) + F{a{k, T - s)) < v{k - a,T - s) + F{a) \Js<5 



12 



ERHAN BAYRAKTAR AND MICHAEL LUDKOVSKI 



Therefore, by continuity of the value function in T, and optimahty of a at T we must have 

(2.18) v{k -a,T)+ F{a) = v{k -b,T) + F{b). 

Let a = 1/(6 - a) G (0, 1). By the strict convexity of F we have that F{a + 1) < aF{b) + (1 - a)F{a). 
Similarly, by Corollary 2.8, we have that v{k — a — 1,T) < av{k — 6, T) + (1 — a)v{k — a,T). Adding 
the two latter equations together we obtain 

v{k-a- 1, T) + F{a + 1) < a{v{k -b,T) + F{b)) + (1 - a){v{k - a, T) + F(a)) 

= v{k-a,T)+F{a), by (2.18), 

which contradicts the optimality of a. □ 

As a corollary of Lemma 2.10 and Corollary 2.13 we have the following result. 

Corollary 2.14. The function T v{k,T) is decreasing and convex. The second derivative of v with 
respect to T is continuous except at T G {t^^'^^ : i = 1, . . . , lk/2\ } (see (2.17) ). 

Proof. We already know that v{-,T) is decreasing from Lemma 2.3. For any we have from 

combining (2.15) and (2.16) that 

(2.19) dTG{k,T) = -X{G{k,T) - G{k - a{k,T),T)), 

since a{k,T) is constant in a neighborhood of T thanks to Corollary 2.13. When T = t^^'^\ the right 
derivative of G is still equal to (2.19) since T — > a(k,T) is right continuous. But the left derivative is 
equal to dTG{k,T—) = —X{G{k,T) — G{k — a{k,T) — l,r)). Recalling Lemma 2.10 we see that the 
second derivative of v with respect to T has a discontinuity at T = t^^'^\ 

On the other hand, by Lemma 2.9, derivatives of G with respect to T are negative and thus the 
second derivative of v is positive. □ 

Another corollary of Lemma 2.11 is the effect of the arrival intensity A of N . 

Corollary 2.15. The value function v{k,T) and optimal action a{k,T) are decreasing in A. 

Note that we have the scaling property v{k,T;X) = v{k, aT; X/a) for any a > since the main 
parameter is intensity of arrivals per effective horizon. Thus, dependence of v (and a) on A is equivalent 
to its inverse dependence on time horizon. Below we give a second proof using the concept of coupling. 
This approach will be re-used later in Section 3.3. 

Proof. Consider two Poisson processes A'^i, with intensities Ai > A2. Then one may construct a 
probability space , J-' ,F') and random variables r^*\ i = 1, 2, A; = 1, 2, . . . such that rj^^ ~ £xp{Xi) 
and r^^'* < rj^^ P'-almost surely. Letting Nl{t) = max(fc : X]^=i '^j*^ — obtain two coupled copies 
N[,N!^ ofNi, N2, such that r{N[{t) > N^{t) \ft) = 1. Now it is fairly obvious that v^^{k,T) < v^^{k,T) 
since working under P', the first case has almost surely more arrivals than the second case. Formally, 
let us define a deterministic time-change by r(t) = Xi/X2t. Then P'(r^^^ G dt) = P'(t^^^ G T{dt)), 
which implies ¥'{N[{t) < j) = F{N2{T{t)) < j) for all j and therefore v^'{k,T) = v^^{k,T{T)) (map 
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any control ^ for v^^{k,T) into a control .^^(^j for ^•^^(/c, t(T))). Now, since t(T) > T it follows that 

The following Proposition is the main result of this section and summarizes all the above analysis. 
Proposition 2.1. Consider the problem 



, T) = inf E 



Then the optimal strategy is given by (2.2) and: 

(i) k v{k,T) is non- decreasing, "convex", and v{k + 1,T) — v{k,T) < F{k + 1) — F{k) for all 
k,T. 

(ii) T v{k, T) is decreasing and convex. Moreover, T d^v{k, T) is discontinuous only at at 
T = t^'''^ (see (2.17);. 

(iii) dTv{k,T) = —X(v{k,T) — mma(z^Q i j^y[v{k — a, T) + F{a)]) < 0. Moreover dTv{k,T) is 
increasing in k. 

(iv) k a{k, T) is non- decreasing and increases by jumps of size 1 only. 

(v) T a(k,T) is non-increasing and right continuous with a{k,0) = [k/2\ and limx^oo (i{k,T) = 
1. Moreover, its jumps are of size 1. The jumps occur at 

Remark 2.1. A word on the computation of the value function and the optimal action. 

Using the above results, one may easily compute v{k,T) for any depth function F{-) by using the 
coupled family of first-order ordinary differential equations (2.16) over a time grid. Note that given 
v{k,T), a{k,T), finding the minimum in the definition of G{k,T + h) requires just one comparison 
since a{k,T -\- h) G {a{k,T), a{k,T) — 1}. Given v{k,T) and a{k,T) an optimal trading strategy is 
straightforwardly implemented using (2.5). 

3. Extensions 

Using the analysis of Section 2 as a starting point, we now consider several progressively more 
sophisticated versions of the original model (1.1) so as to better express the complexities of real markets. 

3.1. Constrained Trading. In this section we consider the modified model whereby is a compound 
Poisson process with characteristics (A, v) and the agent is constrained to trade only up to the order 
size Yi. To summarize, we look at the constrained value function 



v{k, T) = inf E 



E ^(e<x,--e.J+F(eT) 

i:ai<T 



A; G N+, T G M+ 



where Ak is the set of all F -adapted, integer-valued, positive and decreasing processes whose values 
change only at the time of jumps of the Poisson process N in such a way that < ^^i- —£,ai < Yi. Thus, 
the model now also includes the distribution of order sizes. As a first remark, note that we trivially 
have the bound v(k,T) > v{k,T). 
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In counterpart to the dynamic programming equation (2.1), the constrained value function v is the 
unique fixed point of the following functional operator L: 



(3.1) Lv{k,T)=E 



The proof of (3.1), as well as of the fact that L has a unique fixed point is identical to that of Proposi- 
tion 3.2 below and therefore deferred. Let us now define 

(3.2) a{k, T) = argmin„g|i_ fc|{w(A; - a, T) + F{a)]. 

The next proposition is analogous to Proposition 2.1. However, it is complicated by the fact the without 
proving the convexity results regarding v{-,T) it is not clear that 

min (Fia) + v(k-a,T- ai)) = F(a(k, T) ^Yl)+v{k- (a(k, T) AYi),T - ai) . 

ae{l,2,...,Yi/\k} 

The above statement implies that an optimal liquidation strategy consists of placing trades of size 
a{k,T) and then letting them be filled to the maximum extent by the matching incoming orders. 

Proposition 3.1. The following hold: 

(i) k v{k,T) is non- decreasing, "convex", and v{k + l,r) — v{k,T) < F{k + 1) — F{k) for all 
k,T. 

(ii) T v(k, T) is decreasing and convex. 

(iii) Denote by iy[a{k, T), oo) = P(Yi >a{k,T)). Then 

(3.3) dTv{k,T) = -X(v{k,T) - [v {k -a{k,T),T) + F{a{k,T))]iy[a{k,T), oo) 

a{k,T) 

+ Yl '^{yMk-y,T)+F{y)]^ < 0; 

y=l 

moreover dTv{k,T) is increasing in k. 

(iv) k a{k, T) is non- decreasing and increases by jumps of size 1 only. 

(v) T -^a(k,T) is non-increasing and right continuous witha{k,0) = [k/2\ anii limT^^oo a(A;, T) = 
1. Moreover, its jumps are of size 1. The jumps ofT v(k,T) occur at the discontinuity points 
ofT ^ d^v{k,T). 

Proof. We will first consider an auxiliary control problem in which the agent has to submit her sell 
orders before seeing the size of the incoming buy orders^. Let us call the corresponding value function 
by V(fc, T). Again, a dynamic programming principle implies that this value function is the unique fixed 
point of an operator £ that is defined by 

CVik, T) = E [F{k)l{^^^T} + {FHk, T) a Yi) + V (A; - {a{k, T)AYi),T- ai)) 1{^,<t}] , 



^This parallels real markets where once an order is placed, it will be maximally partially filled against any incoming 
matching order. 
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in which 

a{k, T) = argmin„g|i ;,}(V(A; - a) + F{a)). 

The proofs in Section 2 now go through to show that the pair (V, a) satisfies (i)-(v) of Proposition 3.1. 
Now since V(-,T) is convex, it follows that V{k — a,T) + F(a) is monotone on the set {a < a{k,T)} 
and therefore the action of C and L from (3.1) against V is the same. Since V is a fixed point of C, 

V = £V = LV. 

But L has a unique fixed point, so that v{-, •) = V(-, •) and a(-, •) = a(-, •), and the proof is complete. □ 

3.2. Regime Switching Setting. The model in Section 2 assumed a constant level of trade activity 
over the full time horizon. However, as practitioners know, real-life order flows experience multiple 
regime changes. For instance, a common intra-day pattern features high level of activity in the beginning 
and end of the trading session and a lower trade intensity during midday. Alternatively, markets may 
experience liquidity crises, whereby order flow abruptly slows down. To capture such stylized features, 
in this section we assume that is a regime-switching compound Poisson process, modulated by the 
market state variable M. M represents the market liquidity; namely the order frequency and order 
sizes in the order flow book are driven by M. 

Formally, let A^(^), . . . , A^™^ be m independent compound Poisson processes with intensities and jump 
distributions (Ai, vi), . . . , (Am, i^m)- We assume that M forms an independent finite state Markov chain 
with state space £' = {1,2,..., m} and infinitesimal generator Q = {qij). Then the observed order flow 
is given by 

(3.4) Nt= /V l{M.=i}dNf\ t > 0. 

By construction, the increments of A are independent conditioned on M. Let v{k, T; i) represent the 
minimal execution costs conditional on Mq = i. Note that the lower and upper bounds derived in 
Lemmas 2.1 and 2.2 also bound the value function in the regime switching case. The Hamilton- Jacobi- 
Bellman equation for the value function is given by the following lemma, also compare with Lemma 
2.10. 

Lemma 3.1. Let us denote 

G{k, T; i) := v{k, T; i) — mm[v{k — a, T; i) + F[a)\. 

a 

Then derivative of v with respect to its second variable is 

(3.5) dTv{k,T-i) = -\iG{k,T-i)+ qijKk,T;j)-v{k,T;i)). 

jeE\{i} 

Proof. Denote by the k-th transition time of M. For h > 0, let A = {ai > h,Ti > h}, = < 
h,a2 > h,Ti > h}, Bj = {ai > h,Ti < h,T2 > h,Mr^ = j}, j e E \ {i}, and C = {AU Bn Uj Bj)"". 
By conditional independence of A and M we have that F'{A) = F{A\Mo = i) = q-'-^-I")^^ F\Bn) = 
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Xhe lii)^^ W{Bj) = Qijhe ^^^'^ and P*(C) = o{h). Using the dynamic programming principle, we 
can write 



v{k,T + h;i) =r 



^{k,T;i)lA + {v{k,T;i)-G{k,T;i))lB^+ ^ v{k,T; j)lB, + XI 

jeE\{i} 



c 



in which X is a bounded random variable. Taking the limit h ^ we obtain 



lim 



v{k, T + h;i) — v{k, T; i) 



h 



E 



v{k,T;i){lAuBNUjBj) - G{k,T;i)lBr, +J2jeE\{i}iv{k,T;j) - v{k,T;i)lBj) - v{k,T;i) + o{h) 



lim 

h^o h 

\hG{k, T; i) + V mMvik, T; j) - v{K T; i)) + o(/i) 



lim 



h 



= -XG{k,T;i)+ J2 qiMk,T;j)-v{k,T;i)). 
jeE\{i} 

3.3. Partially Observed Setting. We continue to work with the model of the previous section but 
now also assume that the market liquidity variable M is not observed. This is a good proxy for real 
markets where market participants do not know the full liquidity state. Instead, agents infer current 
liquidity based on observed trades. Thus, decreased frequency of trades may point to an impending 
liquidity crisis and therefore force agents to place larger trades to avoid being stuck with an illiquid 
position. 

We shall postulate a Bayesian setting whereby the agent dynamically updates her beliefs about M. 
Let D = {tt G [0, 1]'" : TTi + . . . + 7r„ = 1} be the space of prior distributions of the Markov process M. 
Let 



□ 



(3.6) 



^^{A} = TTi F{A\Mo = 1} + . . . + 7r^P{A|Mo = m} 



for any measurable set A. We define the D-valued conditional probability process n(f) = {Ili{t), . . . , n^(t)) 
such that 



(3.7) 



Ui{t) = P^{Mt = i\Tt^}, for i G and t > 0. 



Each component of n gives the conditional probability that the current state of M is {i} given the 
information generated by A'' until the current time t. 

The partially-observed execution problem can now be stated as 



(3.8) 



v(k, T, 7f) = inf 



i:c7i<T 



where the minimization is over all -adapted admissible controls ^ with = k. We denote this 
restricted set of admissible strategies by .4^. 
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With the partiahy observed setup, the dynamic programming principle for v{k, T, tt) is no longer triv- 
ial. The following proposition establishes such a result using the methods of Bayraktar and Ludkovski 
[2008, 2009]. 

Proposition 3.2. The value function v satisfies the dynamic programming equation v = Lv, in which 
L is the first jump operator given by 

(3.9) Lv{k,T,TT) =E'^ 



,YiAk} 



^(^)l{ai>T}+ min [F{a)+v[k-a,T-ai,n„A)l{^^<T} 

ae{l,2,...,YiAfc| V V J J ^ — ' 



In fact, V is the unique fixed point of L. 



Before giving the proof of Proposition 3.2, it is necessary to first understand the behavior of the 
conditional probability process 11. The sample paths of 11 are obtained as in Bayraktar and Ludkovski 
[2008]. We briefly summarize the developed theory. First, let 

ft ™ 

(3.10) 

•^0 i=l 

By inspection, the expected value of exp(— /(f)) gives the probability of no events for the next t time 
units, namely P'^{cri > t} = E'^[e-^(*)]. The latter expression is found to be [Neuts, 1989, Theorem 
5.3.2] equal to E'^[e"^W] = Y^i'^iit,^), where 



Jo ,_i 



{Mt=i} ■ e 



-m 



(3.11) m{t, 7?) = {mi{t, tt),..., mm{t, vf)) : 
has the form 

where A is the m x m diagonal matrix with An = Xi. It also follows that 



E" 



{ai G du, Mu = i} = E^'" 



i-L{A/„=i}f 



-I(u) 



du = Xi mi{u, tt) du. 



Consequently, conditional on no arrivals observed on [t, t + u] we obtain using Bayes rule 



(3.12) Ui{t + u) 



^^{di > u, Mu = i] 



Xi{u,Ii{t)), where x(t, vf) 



mi{t,TT) 



7r=n(t) 



On the other hand, upon an arrival of order size Y^, the conditional probability 11 experiences a jump 

XiVi{Yi)Iii{ai-) 



(3.13) 



for £ G N. 



Using the above developments, we are led to define ioi i & E the best action operator 
(3.14) 

XiVi{y)TTi XraVm{y)TT'. 



Siw{k, T, vr) = > min < w I k — a,T, 

/I <r^ »>i 1 n / fill I I 



y=l 



a<.mm{k,y) 



+ F{a))iyi{y). 



Combining (3.14)-(3.12)-(3.13) we see that the action of operator L can be expressed as follows. 
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Corollary 3.2. We have 

^(^)l{ai>T}+ min (F{a)+v(k-a,T-ai,nai))l{a^<T} 




mi{T, vf) • F{k) + / mi{u, vf) • Aj • Siv{k, T — u, x{u, 7r))du. 



We now return to the proof of Proposition 3.2. 
Proof. Let us introduce 

(3.16) UQ{k,T,TT) = F{k), Un{k,T,7:) = Lun-i{k,T,TT), n > 1. 

Following the logic of the proof of Proposition 3.1 in Bayraktar and Ludkovski [2008], we can show that 



(3.17) Un{k,T,7r) = Vn{k,T,n) = inf E" 



i<n;(Ji<T 

which denotes the value function under the constraint that the agent only trades during the first n 
orders (and makes zero-trades thereafter until the close T). On the other hand Vn{k,T,7r) = v{k,T,TT), 
for n > k since at most k trades are needed to liquidate a position of size k. Now, thanks to (3.16) 

v{k,T,Tr) = Vk+i{k,T,TT) = Lvk{k,T,Ti) = Lv{k,T,TT). 

The fact that v is the unique fixed point of L, which is an increasing, continuous and concave operator 
(cf. Corollary 3.2), follows from standard results in optimal control, see e.g. Zabczyk [1983] or the proof 
of Theorem 3.1 in Bayraktar and Ludkovski [2009]. □ 

In the special case where there are only two liquidity regimes, E = {1,2} and identical order size 
distributions z^i = 1^2 we may obtain an important monotonicity property of the value function. 

Lemma 3.3. Suppose that E = {1,2} and Ai > A2. Then vr v{k,T, (ir,! — vr)) is a monotone 
increasing function. 

Proof. With two regimes, we identify the vector vf = (vr, 1 — vr) with the scalar vr and subsequently write 
v(/c,T, vr) = v(k,T, (vr, 1 — vr)), P'^ = etc. Observe that the conditional probability of the first 

arrival time E'^[e~^^*)] is monotone in vr, and the vector flow xi(t,vf) is decreasing in t (as no observed 
arrivals increase the likelihood of M being in the low-liquidity state 2). Consequently, if vr > vr', then 
F'"{t) > F^' (t) for all t, where F'^(t) = ¥'^{ai < t) is the distribution of the first arrival time under 
the respective measure. Hence, one may construct a probability measure P and two random variables 
fi < f[ P-a.s., such that fi = (fii under F'^) and f[ = (cji under F^'). Moreover, since the jump operator 
in (3.13) preserves the ordering of vr's (as does the vector flow x), it follows that 

n-'(rO <n-'(n) <n-(n). 

Now, conditional on fi,f[, we again have Fn"(^i)(t) > F^" (^O(t) for ah t and therefore we can select 
interarrival times f2 < P-a.s. with distributions F-f-,^ = F^''^'^^\ Ffi = F^'' By the strong Markov 
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property, n + 72 = oi (resp. f( + fg) has the same distribution as the second arrival time under F"^ 
(resp. IF"^'). By induction, we construct a measure P, and arrivals processes A'"'^, iV^' which satisfy 
V{W{t) > N"" (t) Vt) = 1, while the marginal distributions of {N^,N'^ ) are the same as those of 

((iV, P'^), {N, 

We now use the above coupling argument to recursively construct a tinic-changc t(-). For t < f[ 
define r(t) := {F^)-\F''' {t)). t(-) is well-defined since F^(-) is strictly increasing for all vr. Moreover, 
by assumption, r(t) < t for all t. Inductively, for f'^<t< f'^^^ define T(t) := 

(recall that ffe and fj^ are coupled); then as above we have r(t) < t P-a.s. and r(-) is strictly increasing. 
To conclude, observe that the performance of any given control ^' with respect to N^' is the same as 
the performance of the control ^ with respect to N'^ defined by ^{t) := ^'{T~^{t)), ^{t) := ^(t(T)) for 
t{T) <t<T, since P(4 < t) = ¥{ak < r(t)) for all k,t. Thus, v^'{k,T,Tr') = v^{k,T,Tr) and since ^' 
was arbitrary, v{k,T,Tr') > v{k,T,Tr). 

□ 

3.4. Continuous Sale Amounts. A related limiting model is obtained when we allow the sale amounts 
to be arbitrary real numbers, rather than integers. The corresponding problem becomes 



(3.18) 



u(x, T, 7?) = inf E 



F(^,^_-^,J + F(eT) 



i:ai<T 



where A'f^ 3 Ak is now the set of all F -adapted, non-increasing processes whose values change only at 
the time of jumps of the Poisson process N with = x. The value function when continuous sales are 
allowed is easier to work with. For example, we can easily derive the following result. 

Lemma 3.4. u{x,T,7r) is convex in x. 

Proof. The proof is immediate once one notes that the set of admissible strategies is convex (which was 

not true under integer-constraints). Thus, denote by (resp. ^2) an e-optimal strategy for M(xj,f,7f), 
i = 1, 2. Fix < A < 1. Then, ^ 4 A^i (1 - A)6 is an admissible strategy for u(Xxi -|- (1 — X)x2, t, n) 
since it will sell Axi-units using ^1 and the remaining (1 — X)x2 units using ^2- Finally, 

u{Xxi + (1 — A)x2, t, 7?) < u^{Xxi -|- (1 — X)x2,t, 7?) 



^ F(A(ei(a,-) - Ci(a,)) + (1 - A)(6(a.-) - ^(a.))) + F{X^,{T) + (1 - A)6(T)) 



^ XF{^i{ai-) - 6(^0) + (1 - X)FiW^i-) - W^i)) + AF(ei(r)) + (1 - A)F(6(r)) 



= Xu{xi,t, tt) -|- e -|- (1 — A)'u(x2, t, vf) -|- e, 

where the penultimate line follows by the convexity of F(-). Since e was arbitrary the result follows. □ 

The value function u satisfies a scaling property whenever F does. This helps to reduce the dimension 
of the problem. 
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Lemma 3.5. Let us suppose that the depth function F admits the following scaling property, F(x(3) / F((3) 
H{x), for some function H and all /? > 0. Then u{x,T,tt) = H{x)u{T,tt) in which u is the unique 
solution of 



(3.19) u{T,t:)=W 



H{l)h-,>T} + min (H{a) + H{l-a)-u(T- ai, n^,)) l|,,<r} 



Proof. Using (3.18) and the assumption on F we can see that u{x,T,Tr) = H(x)u{1,T,tt) since if ^ 
is a strategy for u{x,T,tt) then ^/x is a strategy for 'u(l,r, vr). With the latter scaling property, the 
dynamic programming equation (3.19) is just the counterpart of the original (2.1). □ 

Lemma 3.5 leads to the following result which helps us to compute the optimal action directly in the 
continuous-quantity formulation of the original (1.1). 

Corollary 3.6. Let us assume that F(x) = x'^ (i.e. H{x) = x^}, 7 > 1. In the framework of the 
original model (1.1), the function u in Lemma 3.5 satisfies the following non-linear ordinary differential 
equation ( ODE): 

(3-20) aT^x(r) = An(r) f — — — - iV n(0) = l. 



,[l + n(r)V(7-i)]7-i 
Moreover, the optimal action in (3.19) solves 

(3.21) 5Ta(r) = ^a(r)(l-a(r))((l-a(r))^-i-l) <0, a(0) = 1/2, 

7 — 1 

and T a{T) is convex. 

Proof. First, the dynamic programming equation (3.19) leads to the integral equation (note H(l) = 1) 
u(T) = e"^^ + [ min (a^ + (1 - a)^n(r - s)) Xe'^'ds 

Jo «6[0,1] 

= e-^^ (1+ r min (a^ + (1 - a)^u(s)) Xe^'ds^ . 
V Jo »G[0,1] J 



The optimal action evidently satisfies 

u(T)i/(^-i) 



(3.22) a{T) 



i + 'u(r)i/(7-i)- 

If we let /(T) = e^^n(r), it can be shown that 

A/(r) 



1 + {f{T)e-^Tfii~^) 

from which we can derive the ODE for u in (3.20). Finally, we obtain (3.21) for a using (3.22) and the 
ODE for u. Since a(T) < 1/2, by inspection the right-hand-side of (3.21) is negative and it can also be 
shown that dj^a(T) > 0. □ 
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We find that for a power deptli function, Corollary 3.6 provides an excellent approximation even for 
moderate values k > 20. Thus, when the scaling property of F is satisfied, we obtain a very fast method 
to compute v{k,T) ~ H{k)u{T) and a{k,T) ~ k ■ a{T) as defined in (3.20) and (3.21). 

4. Numerical Illustrations 

In this Section we illustrate the results of our analysis with some computational examples. 

We begin with the base model where we take without loss of generality A = 1. We also take a 
quadratic depth function F{a) = a^/2. Solving for a{k,T) using Remark 2.1 we obtain Figure 1. As 
shown in Lemma 2.11, a{k, •) decreases by steps of size 1; at the same time as shown by Corollary 2.6, 
a(-, T) increases by steps of size 1. This surface is used in conjunction with (2.5) to react to the arrivals 
of orders in an optimal way. 




Remaining Shares k 



Time to Maturity T 



Figure 1. Optimal sale amounts a{k, T) as a function of current holding k and time to 
maturity T. We take A = 1, F{a) = a? /2 and the model (1.1). 



We then proceed to study the more complex extensions of Section 3. Thus, we assume that several 
liquidity regimes are possible; to be concrete, we fix the liquidity regime-switching model as Mt € E = 
{High, Med, Low} = {1,2,3} with infinitesimal generator 



Q 



(-2 
1 



Note that M is recurrent. The intensity of orders is A(M() with A = [3,3, 1] and order sizes have the 
strictly positive Poisson distributions fj(y) = "^p( ^^"-^ , y = 1,2, . . ., with mean sizes /2 = [8,4,4]. 

1 cxpl^ f-il ) y- 
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The observed order flow is therefore frequent and of large size in the "High" liquidity regime, frequent 
but small sizes in the "Med" regime and infrequent and small order sizes in the "Low" regime. 

In the case of full observations, v{k, T, i) is easily computed by solving the corresponding system 
of ODE's in (3.5). In this context. Figure 2 shows the effect of constraints on optimal strategy and 
optimal execution cost. We observe that constraints play the largest role at medium time horizons, as 
on long time horizons the agent has plenty of opportunities to trade, while with very short deadlines 
the convexity of F is the determining factor. Also, as expected the agent responds to constraints by 
preemptively placing marginally larger orders in the hope they will be filled. 




0.2 0.4 0.5 0.8 1 1.2 1.4 1.6 1.8 2 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 

Time to Maturity T Time to Maturity T 



Figure 2. The effect of constraints in the regime-switching setting of Section 3.2. Left 
panel shows the difference between v{k, T; i) and v{k, T; i) for a fixed i = 1 and k = 20; 
right panel plots the difference between a{k, T; i) and a{k, T; i) for same i = l,k = 20. 

4.1. Partial Observations. With the partially-observed setting of Section 3.3, the strategies are more 
complex, as they now depend on the dynamic beliefs n(-). Numerically, we compute v{k,T,Tr) and 
a(fc,r, vr) by solving (3.9) on a discrete mesh approximation of D = {vri + 112 + 113 < l,vrj > 0} and 
a discrete time grid with At = 0.01. The action of operator Si in (3.14) is obtained by a linear 
interpolation. Figure 3 shows optimal trading amounts a{k,T,-) for several different horizons T and 
initial holding of A: = 20 shares. As expected, as more time is available till the close, optimal order 
size decreases. We also see that the beliefs of the trader play an important role; in particular when the 
likelihood of being in the "Low" frequency regime is large (bottom right corner), the trader will seek to 
place larger trades. 

To compare the different models of Section 3, Table 1 presents a summary of the various value 
functions. Namely, we compare the effect of partial observations, and also of constraints. Finally, we 
also show the accuracy of upper and lower bounds of Lemmas 2.1 and 2.2 for this case. We see that 
these bounds are quite tight (relative difference of about 10-15%) and can be used to give a quick idea 
about V. The bounds are easily computed via a Monte Carlo simulation: one first simulates paths of 




Figure 3. Optimal Sale Amounts a(fc,T, vf) for different times to maturity and initial 
holding of A: = 20 shares for the model of Section 4.1. Left panel: T = 0.25; middle 
panel: T = 0.45; right panel: T = 1. The triangular regions represent the simplex 
D = {vTi + 7r2 + TTs < 1, vTj > 0} of agent's beliefs. 

the continuous-time Markov chain M and then conditional on such a path simulates N(T) using the 
fact that if M, = j for s G [Ti.Ts] then A^(r2) - iV(ri) ~ Poisson{Xj{T2 - Ti)). 

The comparison between e.g. v{k, T; 1) and v{k, T, (1, 0, 0)) is justified since in both cases the initial 
system state is the same (namely Mq = 1 P-a.s.) and therefore the distribution of possible A^-realizations 
is identical. This is also the reason why the lower and upper bounds are the same for the fully observed 
and partially observed models. Thus, v{k, T, (1,0, 0)) — v{k, T; 1) directly measures the effect of partial 
information on the optimal execution cost. We find that in the unconstrained case, the effect of partial 
observations is mild and on the order of 1-2%. In the given example it is highest in regime 2, which is 
the "junction point" between the favorable "High" liquidity regime 1 and the "Low" -liquidity regime 
3. The addition of constraints accentuates the information loss from not observing M since knowledge 
of M becomes more valuable. Thus, the differences between the partial- and full-observation models 
are now on the order of 4-5% in Table 1. Since the formulas in Lemmas 2.2 and 2.1 are for the base 
case without constraints, the constrained value functions v are typically larger than the upper bound 
V. One could compute an adjusted v that takes into account constraints, but no simple formulas like in 
Lemma 2.2 appear to be forthcoming. 

5. Conclusion 

In this paper we have proposed a new model for studying the optimal trade execution problem in 
financial markets. Our model is directly based on a discrete order flow and therefore is specially suited 
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Fully Observed Regime 


Switching 




Initial regime i 


v{k,T;i) 


v{k,T;i] 


v{k, T; i) 


v{k,T;i) 


1 


73.16 


77.80 


83.54 


83.31 


2 


84.26 


88.36 


98.97 


93.50 


3 


98.94 


102.22 


114.25 


107.11 


Partially Observed Regime Switching 


no = 7r 


v{k, T, 7?) 


v{k,T,Tr 


) v{k,T,7f) 


v{k,T, vf) 


(1,0,0) 


73.16 


78.70 


86.20 


83.31 


(0,1,0) 


84.26 


90.49 


103.14 


93.50 


(0,0,1) 


98.94 


103.03 


119.05 


107.11 


(1/3,1/3,1/3) 


85.46 


89.21 


102.73 


94.64 



Table 1. We consider the regime-switching case with T = 1, A; = 20, F{a) = 
The lower bounds v are computed using Lemma 2.1 and the upper bound v is computed 
using Lemma 2.2. Note that these bounds are the same for fully-observed and partially- 
observed settings. We also compare the constrained v to the basic v. 

to capture the features of trading in dark pools where orders are executed only when matched with a 
crossing counterparty. 

To simplify our presentation, our analysis assumed a simple compound Poisson representation of the 
order flow. However, the obtained dynamic programming equations and most of the stylized properties 
of the value function and optimal strategy are expected to hold in much more general setups. These 
could include time-dependent parameters (such as price impact, order intensity and size distribution) 
or further constraints on optimal strategy. 

Realistic dark pool trading involves simultaneous execution on several exchanges. In particular, the 
trader will place trades both in the dark pool and on the regular limit order book in order to optimize 
the trade-off between liquidity, minimal price impact and information content (dark pool prices are 
often delayed compared to the limit book). In the case where the order flows of different exchanges 
are independent, the problem still fits into our framework, since superposition of independent Poisson 
processes is another Poisson process. The only modification is that orders will now carry the tag of the 
associated exchange and therefore the depth function F will depend on order type. More complicated 
multiple- venue problems can be addressed by considering a multi-dimensional version of our model and 
will be taken up in future work. 
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